Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 50000 |
| Missing cells | 30053 |
| Missing cells (%) | 2.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 10.3 MiB |
| Average record size in memory | 216.0 B |
Variable types
| Categorical | 19 |
|---|---|
| Numeric | 8 |
ID has a high cardinality: 50000 distinct values | High cardinality |
Customer_ID has a high cardinality: 12500 distinct values | High cardinality |
Name has a high cardinality: 10139 distinct values | High cardinality |
Age has a high cardinality: 976 distinct values | High cardinality |
SSN has a high cardinality: 12501 distinct values | High cardinality |
Annual_Income has a high cardinality: 16121 distinct values | High cardinality |
Num_of_Loan has a high cardinality: 263 distinct values | High cardinality |
Type_of_Loan has a high cardinality: 6260 distinct values | High cardinality |
Num_of_Delayed_Payment has a high cardinality: 443 distinct values | High cardinality |
Changed_Credit_Limit has a high cardinality: 3927 distinct values | High cardinality |
Outstanding_Debt has a high cardinality: 12685 distinct values | High cardinality |
Credit_History_Age has a high cardinality: 399 distinct values | High cardinality |
Amount_invested_monthly has a high cardinality: 45450 distinct values | High cardinality |
Monthly_Balance has a high cardinality: 49433 distinct values | High cardinality |
Num_Bank_Accounts is highly correlated with Interest_Rate and 1 other fields | High correlation |
Interest_Rate is highly correlated with Num_Bank_Accounts and 2 other fields | High correlation |
Delay_from_due_date is highly correlated with Credit_Mix and 1 other fields | High correlation |
Num_Credit_Inquiries is highly correlated with Interest_Rate | High correlation |
Credit_Mix is highly correlated with Delay_from_due_date | High correlation |
Payment_of_Min_Amount is highly correlated with Delay_from_due_date | High correlation |
Name has 5015 (10.0%) missing values | Missing |
Monthly_Inhand_Salary has 7498 (15.0%) missing values | Missing |
Type_of_Loan has 5704 (11.4%) missing values | Missing |
Num_of_Delayed_Payment has 3498 (7.0%) missing values | Missing |
Num_Credit_Inquiries has 1035 (2.1%) missing values | Missing |
Credit_History_Age has 4470 (8.9%) missing values | Missing |
Amount_invested_monthly has 2271 (4.5%) missing values | Missing |
Monthly_Balance has 562 (1.1%) missing values | Missing |
ID is uniformly distributed | Uniform |
Customer_ID is uniformly distributed | Uniform |
Month is uniformly distributed | Uniform |
Annual_Income is uniformly distributed | Uniform |
Outstanding_Debt is uniformly distributed | Uniform |
Monthly_Balance is uniformly distributed | Uniform |
ID has unique values | Unique |
Credit_Utilization_Ratio has unique values | Unique |
Num_Bank_Accounts has 2166 (4.3%) zeros | Zeros |
Delay_from_due_date has 626 (1.3%) zeros | Zeros |
Num_Credit_Inquiries has 1102 (2.2%) zeros | Zeros |
Total_EMI_per_month has 5002 (10.0%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-29 10:15:14.164657 |
|---|---|
| Analysis finished | 2022-11-29 10:15:39.210009 |
| Duration | 25.05 seconds |
| Software version | pandas-profiling v3.3.0 |
| Download configuration | config.json |
| Distinct | 50000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| 0x289d | 1 |
|---|---|
| 0x20487 | 1 |
| 0x23501 | 1 |
| 0x168d1 | 1 |
| 0x20186 | 1 |
| Other values (49995) |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.60068 |
| Min length | 6 |
Characters and Unicode
| Total characters | 330034 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 50000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0x160a |
|---|---|
| 2nd row | 0x160b |
| 3rd row | 0x160c |
| 4th row | 0x160d |
| 5th row | 0x1616 |
Common Values
| Value | Count | Frequency (%) |
| 0x289d | 1 | < 0.1% |
| 0x20487 | 1 | < 0.1% |
| 0x23501 | 1 | < 0.1% |
| 0x168d1 | 1 | < 0.1% |
| 0x20186 | 1 | < 0.1% |
| 0x1cb58 | 1 | < 0.1% |
| 0x89a2 | 1 | < 0.1% |
| 0x161bb | 1 | < 0.1% |
| 0x17b52 | 1 | < 0.1% |
| 0x1d22f | 1 | < 0.1% |
| Other values (49990) | 49990 |
Length
| Value | Count | Frequency (%) |
| 0x289d | 1 | < 0.1% |
| 0x24c3 | 1 | < 0.1% |
| 0xe819 | 1 | < 0.1% |
| 0x172cc | 1 | < 0.1% |
| 0x24bed | 1 | < 0.1% |
| 0x5e0a | 1 | < 0.1% |
| 0x13da2 | 1 | < 0.1% |
| 0x14d71 | 1 | < 0.1% |
| 0xecbb | 1 | < 0.1% |
| 0x6d31 | 1 | < 0.1% |
| Other values (49990) | 49990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 62051 | |
| x | 50000 | |
| 1 | 34749 | |
| 2 | 21607 | 6.5% |
| 3 | 13419 | 4.1% |
| 4 | 13417 | 4.1% |
| 5 | 13415 | 4.1% |
| 9 | 12141 | 3.7% |
| c | 12141 | 3.7% |
| 8 | 12139 | 3.7% |
| Other values (7) | 84955 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 207212 | |
| Lowercase Letter | 122822 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 62051 | |
| 1 | 34749 | |
| 2 | 21607 | 10.4% |
| 3 | 13419 | 6.5% |
| 4 | 13417 | 6.5% |
| 5 | 13415 | 6.5% |
| 9 | 12141 | 5.9% |
| 8 | 12139 | 5.9% |
| 6 | 12139 | 5.9% |
| 7 | 12135 | 5.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| x | 50000 | |
| c | 12141 | 9.9% |
| b | 12139 | 9.9% |
| e | 12139 | 9.9% |
| d | 12135 | 9.9% |
| a | 12135 | 9.9% |
| f | 12133 | 9.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 207212 | |
| Latin | 122822 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 62051 | |
| 1 | 34749 | |
| 2 | 21607 | 10.4% |
| 3 | 13419 | 6.5% |
| 4 | 13417 | 6.5% |
| 5 | 13415 | 6.5% |
| 9 | 12141 | 5.9% |
| 8 | 12139 | 5.9% |
| 6 | 12139 | 5.9% |
| 7 | 12135 | 5.9% |
Latin
| Value | Count | Frequency (%) |
| x | 50000 | |
| c | 12141 | 9.9% |
| b | 12139 | 9.9% |
| e | 12139 | 9.9% |
| d | 12135 | 9.9% |
| a | 12135 | 9.9% |
| f | 12133 | 9.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 330034 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 62051 | |
| x | 50000 | |
| 1 | 34749 | |
| 2 | 21607 | 6.5% |
| 3 | 13419 | 4.1% |
| 4 | 13417 | 4.1% |
| 5 | 13415 | 4.1% |
| 9 | 12141 | 3.7% |
| c | 12141 | 3.7% |
| 8 | 12139 | 3.7% |
| Other values (7) | 84955 |
| Distinct | 12500 |
|---|---|
| Distinct (%) | 25.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| CUS_0x2425 | 4 |
|---|---|
| CUS_0x12cb | 4 |
| CUS_0x6df3 | 4 |
| CUS_0xb56b | 4 |
| CUS_0x3032 | 4 |
| Other values (12495) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.93952 |
| Min length | 9 |
Characters and Unicode
| Total characters | 496976 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CUS_0xd40 |
|---|---|
| 2nd row | CUS_0xd40 |
| 3rd row | CUS_0xd40 |
| 4th row | CUS_0xd40 |
| 5th row | CUS_0x21b1 |
Common Values
| Value | Count | Frequency (%) |
| CUS_0x2425 | 4 | < 0.1% |
| CUS_0x12cb | 4 | < 0.1% |
| CUS_0x6df3 | 4 | < 0.1% |
| CUS_0xb56b | 4 | < 0.1% |
| CUS_0x3032 | 4 | < 0.1% |
| CUS_0x3870 | 4 | < 0.1% |
| CUS_0x1bdf | 4 | < 0.1% |
| CUS_0x1fdc | 4 | < 0.1% |
| CUS_0x24cf | 4 | < 0.1% |
| CUS_0x9192 | 4 | < 0.1% |
| Other values (12490) | 49960 |
Length
| Value | Count | Frequency (%) |
| cus_0x2425 | 4 | < 0.1% |
| cus_0x2499 | 4 | < 0.1% |
| cus_0x4d03 | 4 | < 0.1% |
| cus_0x3c6a | 4 | < 0.1% |
| cus_0x143c | 4 | < 0.1% |
| cus_0x2a69 | 4 | < 0.1% |
| cus_0x4797 | 4 | < 0.1% |
| cus_0x3d50 | 4 | < 0.1% |
| cus_0x3fb | 4 | < 0.1% |
| cus_0x997e | 4 | < 0.1% |
| Other values (12490) | 49960 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 59124 | |
| C | 50000 | 10.1% |
| S | 50000 | 10.1% |
| _ | 50000 | 10.1% |
| x | 50000 | 10.1% |
| U | 50000 | 10.1% |
| 4 | 14000 | 2.8% |
| 6 | 13700 | 2.8% |
| 5 | 13600 | 2.7% |
| 3 | 13536 | 2.7% |
| Other values (11) | 133016 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 180604 | |
| Uppercase Letter | 150000 | |
| Lowercase Letter | 116372 | |
| Connector Punctuation | 50000 | 10.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 59124 | |
| 4 | 14000 | 7.8% |
| 6 | 13700 | 7.6% |
| 5 | 13600 | 7.5% |
| 3 | 13536 | 7.5% |
| 8 | 13536 | 7.5% |
| 7 | 13388 | 7.4% |
| 9 | 13368 | 7.4% |
| 2 | 13360 | 7.4% |
| 1 | 12992 | 7.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| x | 50000 | |
| b | 13400 | 11.5% |
| a | 13272 | 11.4% |
| c | 11136 | 9.6% |
| e | 9744 | 8.4% |
| d | 9436 | 8.1% |
| f | 9384 | 8.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 50000 | |
| S | 50000 | |
| U | 50000 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 50000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 266372 | |
| Common | 230604 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 59124 | |
| _ | 50000 | |
| 4 | 14000 | 6.1% |
| 6 | 13700 | 5.9% |
| 5 | 13600 | 5.9% |
| 3 | 13536 | 5.9% |
| 8 | 13536 | 5.9% |
| 7 | 13388 | 5.8% |
| 9 | 13368 | 5.8% |
| 2 | 13360 | 5.8% |
Latin
| Value | Count | Frequency (%) |
| C | 50000 | |
| S | 50000 | |
| x | 50000 | |
| U | 50000 | |
| b | 13400 | 5.0% |
| a | 13272 | 5.0% |
| c | 11136 | 4.2% |
| e | 9744 | 3.7% |
| d | 9436 | 3.5% |
| f | 9384 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 496976 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 59124 | |
| C | 50000 | 10.1% |
| S | 50000 | 10.1% |
| _ | 50000 | 10.1% |
| x | 50000 | 10.1% |
| U | 50000 | 10.1% |
| 4 | 14000 | 2.8% |
| 6 | 13700 | 2.8% |
| 5 | 13600 | 2.7% |
| 3 | 13536 | 2.7% |
| Other values (11) | 133016 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| September | |
|---|---|
| October | |
| December | |
| November |
Length
| Max length | 9 |
|---|---|
| Median length | 8.5 |
| Mean length | 8 |
| Min length | 7 |
Characters and Unicode
| Total characters | 400000 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | September |
|---|---|
| 2nd row | October |
| 3rd row | November |
| 4th row | December |
| 5th row | September |
Common Values
| Value | Count | Frequency (%) |
| September | 12500 | |
| October | 12500 | |
| December | 12500 | |
| November | 12500 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| september | 12500 | |
| october | 12500 | |
| december | 12500 | |
| november | 12500 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 112500 | |
| b | 50000 | |
| r | 50000 | |
| m | 37500 | 9.4% |
| t | 25000 | 6.2% |
| c | 25000 | 6.2% |
| o | 25000 | 6.2% |
| S | 12500 | 3.1% |
| p | 12500 | 3.1% |
| O | 12500 | 3.1% |
| Other values (3) | 37500 | 9.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 350000 | |
| Uppercase Letter | 50000 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 112500 | |
| b | 50000 | |
| r | 50000 | |
| m | 37500 | 10.7% |
| t | 25000 | 7.1% |
| c | 25000 | 7.1% |
| o | 25000 | 7.1% |
| p | 12500 | 3.6% |
| v | 12500 | 3.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 12500 | |
| O | 12500 | |
| D | 12500 | |
| N | 12500 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 400000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 112500 | |
| b | 50000 | |
| r | 50000 | |
| m | 37500 | 9.4% |
| t | 25000 | 6.2% |
| c | 25000 | 6.2% |
| o | 25000 | 6.2% |
| S | 12500 | 3.1% |
| p | 12500 | 3.1% |
| O | 12500 | 3.1% |
| Other values (3) | 37500 | 9.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 400000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 112500 | |
| b | 50000 | |
| r | 50000 | |
| m | 37500 | 9.4% |
| t | 25000 | 6.2% |
| c | 25000 | 6.2% |
| o | 25000 | 6.2% |
| S | 12500 | 3.1% |
| p | 12500 | 3.1% |
| O | 12500 | 3.1% |
| Other values (3) | 37500 | 9.4% |
| Distinct | 10139 |
|---|---|
| Distinct (%) | 22.5% |
| Missing | 5015 |
| Missing (%) | 10.0% |
| Memory size | 390.8 KiB |
| Stevex | 22 |
|---|---|
| Langep | 21 |
| Deepa Seetharamanm | 20 |
| Ronald Groverk | 20 |
| Jessicad | 20 |
| Other values (10134) |
Length
| Max length | 25 |
|---|---|
| Median length | 20 |
| Mean length | 9.758274981 |
| Min length | 2 |
Characters and Unicode
| Total characters | 438976 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 27 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Aaron Maashoh |
|---|---|
| 2nd row | Aaron Maashoh |
| 3rd row | Aaron Maashoh |
| 4th row | Aaron Maashoh |
| 5th row | Rick Rothackerj |
Common Values
| Value | Count | Frequency (%) |
| Stevex | 22 | < 0.1% |
| Langep | 21 | < 0.1% |
| Deepa Seetharamanm | 20 | < 0.1% |
| Ronald Groverk | 20 | < 0.1% |
| Jessicad | 20 | < 0.1% |
| Raymondr | 20 | < 0.1% |
| Nicko | 20 | < 0.1% |
| Vaughanl | 19 | < 0.1% |
| Jonesb | 19 | < 0.1% |
| Jessica Wohlt | 19 | < 0.1% |
| Other values (10129) | 44785 | |
| (Missing) | 5015 | 10.0% |
Length
| Value | Count | Frequency (%) |
| david | 328 | 0.5% |
| jonathan | 300 | 0.5% |
| jessica | 256 | 0.4% |
| sarah | 212 | 0.3% |
| karen | 190 | 0.3% |
| nick | 184 | 0.3% |
| tim | 184 | 0.3% |
| caroline | 181 | 0.3% |
| tom | 174 | 0.3% |
| john | 169 | 0.3% |
| Other values (9720) | 60616 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 45799 | 10.4% |
| e | 38101 | 8.7% |
| n | 29539 | 6.7% |
| i | 29158 | 6.6% |
| r | 27144 | 6.2% |
| o | 22261 | 5.1% |
| l | 21086 | 4.8% |
| 17825 | 4.1% | |
| t | 17446 | 4.0% |
| h | 15228 | 3.5% |
| Other values (47) | 175389 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 356743 | |
| Uppercase Letter | 62618 | 14.3% |
| Space Separator | 17825 | 4.1% |
| Other Punctuation | 1075 | 0.2% |
| Dash Punctuation | 715 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 45799 | |
| e | 38101 | 10.7% |
| n | 29539 | 8.3% |
| i | 29158 | 8.2% |
| r | 27144 | 7.6% |
| o | 22261 | 6.2% |
| l | 21086 | 5.9% |
| t | 17446 | 4.9% |
| h | 15228 | 4.3% |
| s | 15226 | 4.3% |
| Other values (16) | 95755 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 7100 | 11.3% |
| A | 4370 | 7.0% |
| M | 4314 | 6.9% |
| L | 4206 | 6.7% |
| J | 4014 | 6.4% |
| C | 3883 | 6.2% |
| R | 3587 | 5.7% |
| D | 3511 | 5.6% |
| K | 3447 | 5.5% |
| B | 3251 | 5.2% |
| Other values (16) | 20935 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 551 | |
| " | 478 | |
| , | 46 | 4.3% |
Space Separator
| Value | Count | Frequency (%) |
| 17825 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 715 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 419361 | |
| Common | 19615 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 45799 | 10.9% |
| e | 38101 | 9.1% |
| n | 29539 | 7.0% |
| i | 29158 | 7.0% |
| r | 27144 | 6.5% |
| o | 22261 | 5.3% |
| l | 21086 | 5.0% |
| t | 17446 | 4.2% |
| h | 15228 | 3.6% |
| s | 15226 | 3.6% |
| Other values (42) | 158373 |
Common
| Value | Count | Frequency (%) |
| 17825 | ||
| - | 715 | 3.6% |
| . | 551 | 2.8% |
| " | 478 | 2.4% |
| , | 46 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 438976 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 45799 | 10.4% |
| e | 38101 | 8.7% |
| n | 29539 | 6.7% |
| i | 29158 | 6.6% |
| r | 27144 | 6.2% |
| o | 22261 | 5.1% |
| l | 21086 | 4.8% |
| 17825 | 4.1% | |
| t | 17446 | 4.0% |
| h | 15228 | 3.5% |
| Other values (47) | 175389 |
| Distinct | 976 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| 39 | 1493 |
|---|---|
| 32 | 1440 |
| 44 | 1428 |
| 22 | 1422 |
| 35 | 1414 |
| Other values (971) |
Length
| Max length | 5 |
|---|---|
| Median length | 2 |
| Mean length | 2.10342 |
| Min length | 2 |
Characters and Unicode
| Total characters | 105171 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 839 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | 23 |
|---|---|
| 2nd row | 24 |
| 3rd row | 24 |
| 4th row | 24_ |
| 5th row | 28 |
Common Values
| Value | Count | Frequency (%) |
| 39 | 1493 | 3.0% |
| 32 | 1440 | 2.9% |
| 44 | 1428 | 2.9% |
| 22 | 1422 | 2.8% |
| 35 | 1414 | 2.8% |
| 37 | 1397 | 2.8% |
| 27 | 1382 | 2.8% |
| 20 | 1374 | 2.7% |
| 29 | 1368 | 2.7% |
| 26 | 1348 | 2.7% |
| Other values (966) | 35934 |
Length
| Value | Count | Frequency (%) |
| 39 | 1570 | 3.1% |
| 32 | 1529 | 3.1% |
| 44 | 1500 | 3.0% |
| 22 | 1493 | 3.0% |
| 35 | 1483 | 3.0% |
| 37 | 1461 | 2.9% |
| 27 | 1457 | 2.9% |
| 29 | 1441 | 2.9% |
| 20 | 1432 | 2.9% |
| 26 | 1421 | 2.8% |
| Other values (918) | 35213 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 19446 | |
| 3 | 19144 | |
| 4 | 16616 | |
| 5 | 10984 | |
| 1 | 9785 | |
| 0 | 5994 | 5.7% |
| 6 | 5690 | 5.4% |
| 9 | 5305 | 5.0% |
| 7 | 4704 | 4.5% |
| 8 | 4562 | 4.3% |
| Other values (2) | 2941 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 102230 | |
| Connector Punctuation | 2477 | 2.4% |
| Dash Punctuation | 464 | 0.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 19446 | |
| 3 | 19144 | |
| 4 | 16616 | |
| 5 | 10984 | |
| 1 | 9785 | |
| 0 | 5994 | 5.9% |
| 6 | 5690 | 5.6% |
| 9 | 5305 | 5.2% |
| 7 | 4704 | 4.6% |
| 8 | 4562 | 4.5% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2477 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 464 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 105171 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 19446 | |
| 3 | 19144 | |
| 4 | 16616 | |
| 5 | 10984 | |
| 1 | 9785 | |
| 0 | 5994 | 5.7% |
| 6 | 5690 | 5.4% |
| 9 | 5305 | 5.0% |
| 7 | 4704 | 4.5% |
| 8 | 4562 | 4.3% |
| Other values (2) | 2941 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 105171 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 19446 | |
| 3 | 19144 | |
| 4 | 16616 | |
| 5 | 10984 | |
| 1 | 9785 | |
| 0 | 5994 | 5.7% |
| 6 | 5690 | 5.4% |
| 9 | 5305 | 5.0% |
| 7 | 4704 | 4.5% |
| 8 | 4562 | 4.3% |
| Other values (2) | 2941 | 2.8% |
| Distinct | 12501 |
|---|---|
| Distinct (%) | 25.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| #F%$D@*&8 | 2828 |
|---|---|
| 605-63-9678 | 4 |
| 500-54-3583 | 4 |
| 027-69-5774 | 4 |
| 716-63-9191 | 4 |
| Other values (12496) |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.88688 |
| Min length | 9 |
Characters and Unicode
| Total characters | 544344 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 821-00-0265 |
|---|---|
| 2nd row | 821-00-0265 |
| 3rd row | 821-00-0265 |
| 4th row | 821-00-0265 |
| 5th row | 004-07-5839 |
Common Values
| Value | Count | Frequency (%) |
| #F%$D@*&8 | 2828 | 5.7% |
| 605-63-9678 | 4 | < 0.1% |
| 500-54-3583 | 4 | < 0.1% |
| 027-69-5774 | 4 | < 0.1% |
| 716-63-9191 | 4 | < 0.1% |
| 423-96-9115 | 4 | < 0.1% |
| 885-50-0108 | 4 | < 0.1% |
| 268-75-5454 | 4 | < 0.1% |
| 226-58-1620 | 4 | < 0.1% |
| 973-70-9064 | 4 | < 0.1% |
| Other values (12491) | 47136 |
Length
| Value | Count | Frequency (%) |
| f%$d@*&8 | 2828 | 5.7% |
| 999-28-9483 | 4 | < 0.1% |
| 249-63-8554 | 4 | < 0.1% |
| 103-20-7616 | 4 | < 0.1% |
| 956-23-3860 | 4 | < 0.1% |
| 350-85-7603 | 4 | < 0.1% |
| 989-07-3770 | 4 | < 0.1% |
| 300-57-8786 | 4 | < 0.1% |
| 605-53-6907 | 4 | < 0.1% |
| 187-58-8843 | 4 | < 0.1% |
| Other values (12491) | 47136 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 94344 | |
| 8 | 45742 | |
| 1 | 43235 | |
| 4 | 42874 | |
| 2 | 42687 | |
| 7 | 42591 | |
| 9 | 42261 | |
| 0 | 42219 | |
| 5 | 42183 | |
| 3 | 41862 | |
| Other values (9) | 64346 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 427376 | |
| Dash Punctuation | 94344 | 17.3% |
| Other Punctuation | 14140 | 2.6% |
| Uppercase Letter | 5656 | 1.0% |
| Currency Symbol | 2828 | 0.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 45742 | |
| 1 | 43235 | |
| 4 | 42874 | |
| 2 | 42687 | |
| 7 | 42591 | |
| 9 | 42261 | |
| 0 | 42219 | |
| 5 | 42183 | |
| 3 | 41862 | |
| 6 | 41722 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 2828 | |
| * | 2828 | |
| @ | 2828 | |
| % | 2828 | |
| # | 2828 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 2828 | |
| D | 2828 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 94344 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 2828 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 538688 | |
| Latin | 5656 | 1.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 94344 | |
| 8 | 45742 | |
| 1 | 43235 | |
| 4 | 42874 | |
| 2 | 42687 | |
| 7 | 42591 | |
| 9 | 42261 | |
| 0 | 42219 | |
| 5 | 42183 | |
| 3 | 41862 | |
| Other values (7) | 58690 |
Latin
| Value | Count | Frequency (%) |
| F | 2828 | |
| D | 2828 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 544344 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 94344 | |
| 8 | 45742 | |
| 1 | 43235 | |
| 4 | 42874 | |
| 2 | 42687 | |
| 7 | 42591 | |
| 9 | 42261 | |
| 0 | 42219 | |
| 5 | 42183 | |
| 3 | 41862 | |
| Other values (9) | 64346 |
Occupation
Categorical
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| _______ | |
|---|---|
| Lawyer | 3324 |
| Engineer | 3212 |
| Architect | 3195 |
| Mechanic | 3168 |
| Other values (11) |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 8.43476 |
| Min length | 6 |
Characters and Unicode
| Total characters | 421738 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Scientist |
|---|---|
| 2nd row | Scientist |
| 3rd row | Scientist |
| 4th row | Scientist |
| 5th row | _______ |
Common Values
| Value | Count | Frequency (%) |
| _______ | 3438 | 6.9% |
| Lawyer | 3324 | 6.6% |
| Engineer | 3212 | 6.4% |
| Architect | 3195 | 6.4% |
| Mechanic | 3168 | 6.3% |
| Developer | 3146 | 6.3% |
| Accountant | 3133 | 6.3% |
| Media_Manager | 3130 | 6.3% |
| Scientist | 3104 | 6.2% |
| Teacher | 3103 | 6.2% |
| Other values (6) | 18047 |
Length
| Value | Count | Frequency (%) |
| 3438 | 6.9% | |
| lawyer | 3324 | 6.6% |
| engineer | 3212 | 6.4% |
| architect | 3195 | 6.4% |
| mechanic | 3168 | 6.3% |
| developer | 3146 | 6.3% |
| accountant | 3133 | 6.3% |
| media_manager | 3130 | 6.3% |
| scientist | 3104 | 6.2% |
| teacher | 3103 | 6.2% |
| Other values (6) | 18047 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 56361 | |
| r | 43349 | |
| n | 37282 | 8.8% |
| a | 34102 | 8.1% |
| c | 31173 | 7.4% |
| t | 30964 | 7.3% |
| i | 30777 | 7.3% |
| _ | 27196 | 6.4% |
| M | 15375 | 3.6% |
| o | 15370 | 3.6% |
| Other values (18) | 99789 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 344850 | |
| Uppercase Letter | 49692 | 11.8% |
| Connector Punctuation | 27196 | 6.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 56361 | |
| r | 43349 | |
| n | 37282 | |
| a | 34102 | |
| c | 31173 | |
| t | 30964 | |
| i | 30777 | |
| o | 15370 | 4.5% |
| u | 12220 | 3.5% |
| h | 9466 | 2.7% |
| Other values (8) | 43786 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 15375 | |
| A | 6328 | |
| E | 6315 | |
| D | 6173 | |
| L | 3324 | 6.7% |
| S | 3104 | 6.2% |
| T | 3103 | 6.2% |
| J | 3037 | 6.1% |
| W | 2933 | 5.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 27196 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 394542 | |
| Common | 27196 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 56361 | |
| r | 43349 | |
| n | 37282 | |
| a | 34102 | 8.6% |
| c | 31173 | 7.9% |
| t | 30964 | 7.8% |
| i | 30777 | 7.8% |
| M | 15375 | 3.9% |
| o | 15370 | 3.9% |
| u | 12220 | 3.1% |
| Other values (17) | 87569 |
Common
| Value | Count | Frequency (%) |
| _ | 27196 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 421738 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 56361 | |
| r | 43349 | |
| n | 37282 | 8.8% |
| a | 34102 | 8.1% |
| c | 31173 | 7.4% |
| t | 30964 | 7.3% |
| i | 30777 | 7.3% |
| _ | 27196 | 6.4% |
| M | 15375 | 3.6% |
| o | 15370 | 3.6% |
| Other values (18) | 99789 |
| Distinct | 16121 |
|---|---|
| Distinct (%) | 32.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| 9141.63 | 8 |
|---|---|
| 95596.35 | 8 |
| 72524.2 | 8 |
| 36585.12 | 8 |
| 22434.16 | 8 |
| Other values (16116) |
Length
| Max length | 19 |
|---|---|
| Median length | 8 |
| Mean length | 8.3094 |
| Min length | 6 |
Characters and Unicode
| Total characters | 415470 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3335 ? |
|---|---|
| Unique (%) | 6.7% |
Sample
| 1st row | 19114.12 |
|---|---|
| 2nd row | 19114.12 |
| 3rd row | 19114.12 |
| 4th row | 19114.12 |
| 5th row | 34847.84 |
Common Values
| Value | Count | Frequency (%) |
| 9141.63 | 8 | < 0.1% |
| 95596.35 | 8 | < 0.1% |
| 72524.2 | 8 | < 0.1% |
| 36585.12 | 8 | < 0.1% |
| 22434.16 | 8 | < 0.1% |
| 17816.75 | 8 | < 0.1% |
| 109945.32 | 8 | < 0.1% |
| 20867.67 | 7 | < 0.1% |
| 40341.16 | 7 | < 0.1% |
| 33029.66 | 7 | < 0.1% |
| Other values (16111) | 49923 |
Length
| Value | Count | Frequency (%) |
| 9141.63 | 8 | < 0.1% |
| 20867.67 | 8 | < 0.1% |
| 95596.35 | 8 | < 0.1% |
| 32543.38 | 8 | < 0.1% |
| 33029.66 | 8 | < 0.1% |
| 40341.16 | 8 | < 0.1% |
| 17273.83 | 8 | < 0.1% |
| 109945.32 | 8 | < 0.1% |
| 17816.75 | 8 | < 0.1% |
| 22434.16 | 8 | < 0.1% |
| Other values (12979) | 49920 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 50000 | |
| 1 | 46376 | |
| 2 | 38186 | |
| 4 | 35878 | |
| 3 | 35800 | |
| 8 | 35462 | |
| 5 | 35357 | |
| 6 | 35326 | |
| 9 | 34524 | |
| 0 | 33173 | |
| Other values (2) | 35388 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 361950 | |
| Other Punctuation | 50000 | 12.0% |
| Connector Punctuation | 3520 | 0.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 46376 | |
| 2 | 38186 | |
| 4 | 35878 | |
| 3 | 35800 | |
| 8 | 35462 | |
| 5 | 35357 | |
| 6 | 35326 | |
| 9 | 34524 | |
| 0 | 33173 | |
| 7 | 31868 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 50000 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3520 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 415470 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 50000 | |
| 1 | 46376 | |
| 2 | 38186 | |
| 4 | 35878 | |
| 3 | 35800 | |
| 8 | 35462 | |
| 5 | 35357 | |
| 6 | 35326 | |
| 9 | 34524 | |
| 0 | 33173 | |
| Other values (2) | 35388 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 415470 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 50000 | |
| 1 | 46376 | |
| 2 | 38186 | |
| 4 | 35878 | |
| 3 | 35800 | |
| 8 | 35462 | |
| 5 | 35357 | |
| 6 | 35326 | |
| 9 | 34524 | |
| 0 | 33173 | |
| Other values (2) | 35388 |
| Distinct | 12793 |
|---|---|
| Distinct (%) | 30.1% |
| Missing | 7498 |
| Missing (%) | 15.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4182.004291 |
| Minimum | 303.6454167 |
|---|---|
| Maximum | 15204.63333 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 303.6454167 |
|---|---|
| 5-th percentile | 833.7731458 |
| Q1 | 1625.188333 |
| median | 3086.305 |
| Q3 | 5934.189094 |
| 95-th percentile | 10771.23333 |
| Maximum | 15204.63333 |
| Range | 14900.98792 |
| Interquartile range (IQR) | 4309.00076 |
Descriptive statistics
| Standard deviation | 3174.109304 |
|---|---|
| Coefficient of variation (CV) | 0.7589923594 |
| Kurtosis | 0.6280610902 |
| Mean | 4182.004291 |
| Median Absolute Deviation (MAD) | 1750.829583 |
| Skewness | 1.131373974 |
| Sum | 177743546.4 |
| Variance | 10074969.87 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1315.560833 | 8 | < 0.1% |
| 4387.2725 | 7 | < 0.1% |
| 3080.555 | 7 | < 0.1% |
| 5766.491667 | 7 | < 0.1% |
| 6082.1875 | 7 | < 0.1% |
| 2295.058333 | 7 | < 0.1% |
| 6639.56 | 7 | < 0.1% |
| 536.43125 | 7 | < 0.1% |
| 6358.956667 | 6 | < 0.1% |
| 10511.33 | 4 | < 0.1% |
| Other values (12783) | 42435 | |
| (Missing) | 7498 | 15.0% |
| Value | Count | Frequency (%) |
| 303.6454167 | 2 | |
| 319.55625 | 4 | |
| 331.0319233 | 2 | |
| 332.1283333 | 3 | |
| 332.43125 | 4 | |
| 333.5966667 | 4 | |
| 355.2083333 | 4 | |
| 357.2558333 | 4 | |
| 358.0583333 | 4 | |
| 361.6033333 | 4 |
| Value | Count | Frequency (%) |
| 15204.63333 | 3 | |
| 15167.18 | 4 | |
| 15136.69667 | 3 | |
| 15115.19 | 3 | |
| 15101.94 | 3 | |
| 15090.07667 | 4 | |
| 15066.78333 | 4 | |
| 14978.33667 | 3 | |
| 14960.25 | 1 | < 0.1% |
| 14929.54 | 3 |
| Distinct | 540 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.83826 |
| Minimum | -1 |
|---|---|
| Maximum | 1798 |
| Zeros | 2166 |
| Zeros (%) | 4.3% |
| Negative | 16 |
| Negative (%) | < 0.1% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 7 |
| 95-th percentile | 10 |
| Maximum | 1798 |
| Range | 1799 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 116.3968482 |
|---|---|
| Coefficient of variation (CV) | 6.912641103 |
| Kurtosis | 132.919184 |
| Mean | 16.83826 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 11.25168183 |
| Sum | 841913 |
| Variance | 13548.22626 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 6504 | |
| 7 | 6408 | |
| 8 | 6387 | |
| 4 | 6100 | |
| 5 | 6068 | |
| 3 | 5955 | |
| 9 | 2738 | |
| 10 | 2599 | 5.2% |
| 1 | 2253 | 4.5% |
| 0 | 2166 | 4.3% |
| Other values (530) | 2822 |
| Value | Count | Frequency (%) |
| -1 | 16 | < 0.1% |
| 0 | 2166 | 4.3% |
| 1 | 2253 | 4.5% |
| 2 | 2152 | 4.3% |
| 3 | 5955 | |
| 4 | 6100 | |
| 5 | 6068 | |
| 6 | 6504 | |
| 7 | 6408 | |
| 8 | 6387 |
| Value | Count | Frequency (%) |
| 1798 | 1 | |
| 1783 | 1 | |
| 1781 | 1 | |
| 1780 | 1 | |
| 1775 | 1 | |
| 1774 | 2 | |
| 1773 | 1 | |
| 1772 | 1 | |
| 1771 | 1 | |
| 1770 | 1 |
Num_Credit_Card
Real number (ℝ≥0)
| Distinct | 819 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.92148 |
| Minimum | 0 |
|---|---|
| Maximum | 1499 |
| Zeros | 16 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 4 |
| median | 5 |
| Q3 | 7 |
| 95-th percentile | 10 |
| Maximum | 1499 |
| Range | 1499 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 129.3148043 |
|---|---|
| Coefficient of variation (CV) | 5.641642872 |
| Kurtosis | 71.87065897 |
| Mean | 22.92148 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 8.286879673 |
| Sum | 1146074 |
| Variance | 16722.3186 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 9210 | |
| 7 | 8271 | |
| 6 | 8243 | |
| 4 | 7072 | |
| 3 | 6539 | |
| 8 | 2497 | 5.0% |
| 10 | 2405 | 4.8% |
| 9 | 2333 | 4.7% |
| 2 | 1131 | 2.3% |
| 1 | 1063 | 2.1% |
| Other values (809) | 1236 | 2.5% |
| Value | Count | Frequency (%) |
| 0 | 16 | < 0.1% |
| 1 | 1063 | 2.1% |
| 2 | 1131 | 2.3% |
| 3 | 6539 | |
| 4 | 7072 | |
| 5 | 9210 | |
| 6 | 8243 | |
| 7 | 8271 | |
| 8 | 2497 | 5.0% |
| 9 | 2333 | 4.7% |
| Value | Count | Frequency (%) |
| 1499 | 1 | < 0.1% |
| 1498 | 2 | |
| 1495 | 1 | < 0.1% |
| 1491 | 1 | < 0.1% |
| 1488 | 1 | < 0.1% |
| 1486 | 1 | < 0.1% |
| 1485 | 1 | < 0.1% |
| 1484 | 2 | |
| 1481 | 2 | |
| 1474 | 3 |
| Distinct | 945 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68.77264 |
| Minimum | 1 |
|---|---|
| Maximum | 5799 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 13 |
| Q3 | 20 |
| 95-th percentile | 32 |
| Maximum | 5799 |
| Range | 5798 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 451.6023629 |
|---|---|
| Coefficient of variation (CV) | 6.566599202 |
| Kurtosis | 92.48656451 |
| Mean | 68.77264 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 9.370223147 |
| Sum | 3438632 |
| Variance | 203944.6942 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 2503 | 5.0% |
| 5 | 2500 | 5.0% |
| 6 | 2368 | 4.7% |
| 12 | 2288 | 4.6% |
| 10 | 2259 | 4.5% |
| 9 | 2253 | 4.5% |
| 7 | 2250 | 4.5% |
| 11 | 2198 | 4.4% |
| 18 | 2052 | 4.1% |
| 15 | 1992 | 4.0% |
| Other values (935) | 27337 |
| Value | Count | Frequency (%) |
| 1 | 1344 | |
| 2 | 1245 | |
| 3 | 1388 | |
| 4 | 1287 | |
| 5 | 2500 | |
| 6 | 2368 | |
| 7 | 2250 | |
| 8 | 2503 | |
| 9 | 2253 | |
| 10 | 2259 |
| Value | Count | Frequency (%) |
| 5799 | 1 | |
| 5792 | 1 | |
| 5773 | 1 | |
| 5759 | 2 | |
| 5752 | 1 | |
| 5748 | 1 | |
| 5747 | 1 | |
| 5743 | 1 | |
| 5736 | 1 | |
| 5732 | 1 |
| Distinct | 263 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| 2 | |
|---|---|
| 3 | |
| 4 | |
| 0 | |
| 1 | |
| Other values (258) |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.17906 |
| Min length | 1 |
Characters and Unicode
| Total characters | 58953 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 226 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 4 |
| 3rd row | 4 |
| 4th row | 4 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 7173 | |
| 3 | 7114 | |
| 4 | 6982 | |
| 0 | 5163 | |
| 1 | 5029 | |
| 6 | 3707 | |
| 7 | 3483 | |
| 5 | 3437 | |
| -100 | 1974 | 3.9% |
| 9 | 1746 | 3.5% |
| Other values (253) | 4192 |
Length
| Value | Count | Frequency (%) |
| 2 | 7515 | |
| 3 | 7514 | |
| 4 | 7368 | |
| 0 | 5446 | |
| 1 | 5295 | |
| 6 | 3902 | |
| 7 | 3680 | |
| 5 | 3617 | |
| 100 | 1974 | 3.9% |
| 9 | 1837 | 3.7% |
| Other values (242) | 1852 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9458 | |
| 2 | 7597 | |
| 3 | 7597 | |
| 4 | 7476 | |
| 1 | 7438 | |
| 6 | 3976 | |
| 7 | 3742 | 6.3% |
| 5 | 3696 | 6.3% |
| _ | 2436 | 4.1% |
| - | 1974 | 3.3% |
| Other values (2) | 3563 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 54543 | |
| Connector Punctuation | 2436 | 4.1% |
| Dash Punctuation | 1974 | 3.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9458 | |
| 2 | 7597 | |
| 3 | 7597 | |
| 4 | 7476 | |
| 1 | 7438 | |
| 6 | 3976 | |
| 7 | 3742 | 6.9% |
| 5 | 3696 | 6.8% |
| 9 | 1908 | 3.5% |
| 8 | 1655 | 3.0% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2436 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1974 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 58953 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 9458 | |
| 2 | 7597 | |
| 3 | 7597 | |
| 4 | 7476 | |
| 1 | 7438 | |
| 6 | 3976 | |
| 7 | 3742 | 6.3% |
| 5 | 3696 | 6.3% |
| _ | 2436 | 4.1% |
| - | 1974 | 3.3% |
| Other values (2) | 3563 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 58953 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 9458 | |
| 2 | 7597 | |
| 3 | 7597 | |
| 4 | 7476 | |
| 1 | 7438 | |
| 6 | 3976 | |
| 7 | 3742 | 6.3% |
| 5 | 3696 | 6.3% |
| _ | 2436 | 4.1% |
| - | 1974 | 3.3% |
| Other values (2) | 3563 | 6.0% |
| Distinct | 6260 |
|---|---|
| Distinct (%) | 14.1% |
| Missing | 5704 |
| Missing (%) | 11.4% |
| Memory size | 390.8 KiB |
| Not Specified | 704 |
|---|---|
| Credit-Builder Loan | 640 |
| Personal Loan | 636 |
| Debt Consolidation Loan | 632 |
| Student Loan | 620 |
| Other values (6255) |
Length
| Max length | 182 |
|---|---|
| Median length | 142 |
| Mean length | 66.68358317 |
| Min length | 9 |
Characters and Unicode
| Total characters | 2953816 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Auto Loan, Credit-Builder Loan, Personal Loan, and Home Equity Loan |
|---|---|
| 2nd row | Auto Loan, Credit-Builder Loan, Personal Loan, and Home Equity Loan |
| 3rd row | Auto Loan, Credit-Builder Loan, Personal Loan, and Home Equity Loan |
| 4th row | Auto Loan, Credit-Builder Loan, Personal Loan, and Home Equity Loan |
| 5th row | Credit-Builder Loan |
Common Values
| Value | Count | Frequency (%) |
| Not Specified | 704 | 1.4% |
| Credit-Builder Loan | 640 | 1.3% |
| Personal Loan | 636 | 1.3% |
| Debt Consolidation Loan | 632 | 1.3% |
| Student Loan | 620 | 1.2% |
| Payday Loan | 600 | 1.2% |
| Mortgage Loan | 588 | 1.2% |
| Auto Loan | 576 | 1.2% |
| Home Equity Loan | 568 | 1.1% |
| Personal Loan, and Student Loan | 160 | 0.3% |
| Other values (6250) | 38572 | |
| (Missing) | 5704 | 11.4% |
Length
| Value | Count | Frequency (%) |
| loan | 156836 | |
| and | 38732 | 9.0% |
| payday | 20284 | 4.7% |
| credit-builder | 20220 | 4.7% |
| not | 19808 | 4.6% |
| specified | 19808 | 4.6% |
| home | 19552 | 4.5% |
| equity | 19552 | 4.5% |
| student | 19484 | 4.5% |
| mortgage | 19468 | 4.5% |
| Other values (4) | 77216 |
Most occurring characters
| Value | Count | Frequency (%) |
| 386664 | ||
| o | 312268 | |
| a | 294436 | 10.0% |
| n | 273272 | 9.3% |
| e | 177392 | 6.0% |
| t | 175788 | 6.0% |
| d | 158136 | 5.4% |
| L | 156836 | 5.3% |
| i | 138384 | 4.7% |
| , | 132348 | 4.5% |
| Other values (23) | 748292 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2002136 | |
| Uppercase Letter | 412448 | 14.0% |
| Space Separator | 386664 | 13.1% |
| Other Punctuation | 132348 | 4.5% |
| Dash Punctuation | 20220 | 0.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 312268 | |
| a | 294436 | |
| n | 273272 | |
| e | 177392 | |
| t | 175788 | |
| d | 158136 | |
| i | 138384 | |
| r | 79352 | 4.0% |
| u | 78252 | 3.9% |
| y | 60120 | 3.0% |
| Other values (9) | 254736 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 156836 | |
| P | 39728 | 9.6% |
| C | 39608 | 9.6% |
| S | 39292 | 9.5% |
| B | 20220 | 4.9% |
| N | 19808 | 4.8% |
| H | 19552 | 4.7% |
| E | 19552 | 4.7% |
| M | 19468 | 4.7% |
| D | 19388 | 4.7% |
Space Separator
| Value | Count | Frequency (%) |
| 386664 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 132348 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 20220 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2414584 | |
| Common | 539232 | 18.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 312268 | |
| a | 294436 | |
| n | 273272 | |
| e | 177392 | 7.3% |
| t | 175788 | 7.3% |
| d | 158136 | 6.5% |
| L | 156836 | 6.5% |
| i | 138384 | 5.7% |
| r | 79352 | 3.3% |
| u | 78252 | 3.2% |
| Other values (20) | 570468 |
Common
| Value | Count | Frequency (%) |
| 386664 | ||
| , | 132348 | 24.5% |
| - | 20220 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2953816 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 386664 | ||
| o | 312268 | |
| a | 294436 | 10.0% |
| n | 273272 | 9.3% |
| e | 177392 | 6.0% |
| t | 175788 | 6.0% |
| d | 158136 | 5.4% |
| L | 156836 | 5.3% |
| i | 138384 | 4.7% |
| , | 132348 | 4.5% |
| Other values (23) | 748292 |
| Distinct | 73 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21.05264 |
| Minimum | -5 |
|---|---|
| Maximum | 67 |
| Zeros | 626 |
| Zeros (%) | 1.3% |
| Negative | 298 |
| Negative (%) | 0.6% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | -5 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 10 |
| median | 18 |
| Q3 | 28 |
| 95-th percentile | 54 |
| Maximum | 67 |
| Range | 72 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 14.86039722 |
|---|---|
| Coefficient of variation (CV) | 0.7058685858 |
| Kurtosis | 0.3444273989 |
| Mean | 21.05264 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.9649281101 |
| Sum | 1052632 |
| Variance | 220.8314057 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13 | 1761 | 3.5% |
| 15 | 1759 | 3.5% |
| 8 | 1680 | 3.4% |
| 9 | 1656 | 3.3% |
| 10 | 1645 | 3.3% |
| 14 | 1636 | 3.3% |
| 12 | 1625 | 3.2% |
| 7 | 1587 | 3.2% |
| 6 | 1584 | 3.2% |
| 11 | 1573 | 3.1% |
| Other values (63) | 33494 |
| Value | Count | Frequency (%) |
| -5 | 18 | < 0.1% |
| -4 | 49 | 0.1% |
| -3 | 59 | 0.1% |
| -2 | 71 | 0.1% |
| -1 | 101 | 0.2% |
| 0 | 626 | |
| 1 | 668 | |
| 2 | 669 | |
| 3 | 848 | |
| 4 | 825 |
| Value | Count | Frequency (%) |
| 67 | 7 | < 0.1% |
| 66 | 12 | < 0.1% |
| 65 | 30 | 0.1% |
| 64 | 33 | 0.1% |
| 63 | 21 | < 0.1% |
| 62 | 279 | |
| 61 | 271 | |
| 60 | 259 | |
| 59 | 250 | |
| 58 | 282 |
| Distinct | 443 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 3498 |
| Missing (%) | 7.0% |
| Memory size | 390.8 KiB |
| 19 | 2622 |
|---|---|
| 15 | 2594 |
| 18 | 2570 |
| 16 | 2548 |
| 17 | 2545 |
| Other values (438) |
Length
| Max length | 5 |
|---|---|
| Median length | 2 |
| Mean length | 1.772912993 |
| Min length | 1 |
Characters and Unicode
| Total characters | 82444 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 367 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | 7 |
|---|---|
| 2nd row | 9 |
| 3rd row | 4 |
| 4th row | 5 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 19 | 2622 | 5.2% |
| 15 | 2594 | 5.2% |
| 18 | 2570 | 5.1% |
| 16 | 2548 | 5.1% |
| 17 | 2545 | 5.1% |
| 10 | 2517 | 5.0% |
| 12 | 2483 | 5.0% |
| 11 | 2440 | 4.9% |
| 20 | 2422 | 4.8% |
| 9 | 2365 | 4.7% |
| Other values (433) | 21396 | |
| (Missing) | 3498 | 7.0% |
Length
| Value | Count | Frequency (%) |
| 19 | 2707 | 5.8% |
| 15 | 2674 | 5.8% |
| 16 | 2637 | 5.7% |
| 17 | 2636 | 5.7% |
| 18 | 2631 | 5.7% |
| 10 | 2591 | 5.6% |
| 12 | 2563 | 5.5% |
| 20 | 2518 | 5.4% |
| 11 | 2504 | 5.4% |
| 9 | 2440 | 5.2% |
| Other values (398) | 20601 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 30118 | |
| 2 | 13020 | |
| 0 | 6032 | 7.3% |
| 9 | 5262 | 6.4% |
| 8 | 5260 | 6.4% |
| 5 | 4698 | 5.7% |
| 3 | 4317 | 5.2% |
| 7 | 4033 | 4.9% |
| 6 | 4015 | 4.9% |
| 4 | 3975 | 4.8% |
| Other values (2) | 1714 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 80730 | |
| Connector Punctuation | 1427 | 1.7% |
| Dash Punctuation | 287 | 0.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 30118 | |
| 2 | 13020 | |
| 0 | 6032 | 7.5% |
| 9 | 5262 | 6.5% |
| 8 | 5260 | 6.5% |
| 5 | 4698 | 5.8% |
| 3 | 4317 | 5.3% |
| 7 | 4033 | 5.0% |
| 6 | 4015 | 5.0% |
| 4 | 3975 | 4.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1427 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 287 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 82444 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 30118 | |
| 2 | 13020 | |
| 0 | 6032 | 7.3% |
| 9 | 5262 | 6.4% |
| 8 | 5260 | 6.4% |
| 5 | 4698 | 5.7% |
| 3 | 4317 | 5.2% |
| 7 | 4033 | 4.9% |
| 6 | 4015 | 4.9% |
| 4 | 3975 | 4.8% |
| Other values (2) | 1714 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 82444 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 30118 | |
| 2 | 13020 | |
| 0 | 6032 | 7.3% |
| 9 | 5262 | 6.4% |
| 8 | 5260 | 6.4% |
| 5 | 4698 | 5.7% |
| 3 | 4317 | 5.2% |
| 7 | 4033 | 4.9% |
| 6 | 4015 | 4.9% |
| 4 | 3975 | 4.8% |
| Other values (2) | 1714 | 2.1% |
| Distinct | 3927 |
|---|---|
| Distinct (%) | 7.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| _ | 1059 |
|---|---|
| 11.5 | 70 |
| 11.32 | 63 |
| 7.01 | 60 |
| 7.35 | 60 |
| Other values (3922) |
Length
| Max length | 21 |
|---|---|
| Median length | 20 |
| Mean length | 4.69558 |
| Min length | 1 |
Characters and Unicode
| Total characters | 234779 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 695 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | 11.27 |
|---|---|
| 2nd row | 13.27 |
| 3rd row | 12.27 |
| 4th row | 11.27 |
| 5th row | 5.42 |
Common Values
| Value | Count | Frequency (%) |
| _ | 1059 | 2.1% |
| 11.5 | 70 | 0.1% |
| 11.32 | 63 | 0.1% |
| 7.01 | 60 | 0.1% |
| 7.35 | 60 | 0.1% |
| 10.06 | 57 | 0.1% |
| 8.22 | 56 | 0.1% |
| 7.63 | 56 | 0.1% |
| 7.69 | 56 | 0.1% |
| 10.3 | 55 | 0.1% |
| Other values (3917) | 48408 |
Length
| Value | Count | Frequency (%) |
| 1059 | 2.1% | |
| 11.5 | 70 | 0.1% |
| 11.32 | 63 | 0.1% |
| 7.01 | 60 | 0.1% |
| 7.35 | 60 | 0.1% |
| 3.93 | 57 | 0.1% |
| 10.06 | 57 | 0.1% |
| 8.22 | 56 | 0.1% |
| 7.63 | 56 | 0.1% |
| 7.69 | 56 | 0.1% |
| Other values (3471) | 48406 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 48941 | |
| 1 | 34489 | |
| 9 | 23001 | |
| 0 | 19861 | |
| 2 | 18122 | 7.7% |
| 7 | 15298 | 6.5% |
| 8 | 15257 | 6.5% |
| 5 | 14815 | 6.3% |
| 6 | 14574 | 6.2% |
| 3 | 14319 | 6.1% |
| Other values (3) | 16102 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 183944 | |
| Other Punctuation | 48941 | 20.8% |
| Connector Punctuation | 1059 | 0.5% |
| Dash Punctuation | 835 | 0.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 34489 | |
| 9 | 23001 | |
| 0 | 19861 | |
| 2 | 18122 | |
| 7 | 15298 | |
| 8 | 15257 | |
| 5 | 14815 | |
| 6 | 14574 | |
| 3 | 14319 | |
| 4 | 14208 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 48941 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1059 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 835 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 234779 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 48941 | |
| 1 | 34489 | |
| 9 | 23001 | |
| 0 | 19861 | |
| 2 | 18122 | 7.7% |
| 7 | 15298 | 6.5% |
| 8 | 15257 | 6.5% |
| 5 | 14815 | 6.3% |
| 6 | 14574 | 6.2% |
| 3 | 14319 | 6.1% |
| Other values (3) | 16102 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 234779 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 48941 | |
| 1 | 34489 | |
| 9 | 23001 | |
| 0 | 19861 | |
| 2 | 18122 | 7.7% |
| 7 | 15298 | 6.5% |
| 8 | 15257 | 6.5% |
| 5 | 14815 | 6.3% |
| 6 | 14574 | 6.2% |
| 3 | 14319 | 6.1% |
| Other values (3) | 16102 | 6.9% |
| Distinct | 750 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 1035 |
| Missing (%) | 2.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.08020014 |
| Minimum | 0 |
|---|---|
| Maximum | 2593 |
| Zeros | 1102 |
| Zeros (%) | 2.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 15 |
| Maximum | 2593 |
| Range | 2593 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 196.9841205 |
|---|---|
| Coefficient of variation (CV) | 6.548630646 |
| Kurtosis | 96.36985966 |
| Mean | 30.08020014 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 9.587172676 |
| Sum | 1472877 |
| Variance | 38802.74373 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 4709 | |
| 4 | 4402 | 8.8% |
| 6 | 4375 | 8.8% |
| 7 | 4295 | 8.6% |
| 8 | 3922 | 7.8% |
| 9 | 3523 | 7.0% |
| 3 | 3466 | 6.9% |
| 11 | 2996 | 6.0% |
| 10 | 2982 | 6.0% |
| 12 | 2585 | 5.2% |
| Other values (740) | 11710 |
| Value | Count | Frequency (%) |
| 0 | 1102 | 2.2% |
| 1 | 1747 | 3.5% |
| 2 | 2454 | |
| 3 | 3466 | |
| 4 | 4402 | |
| 5 | 4709 | |
| 6 | 4375 | |
| 7 | 4295 | |
| 8 | 3922 | |
| 9 | 3523 |
| Value | Count | Frequency (%) |
| 2593 | 1 | |
| 2592 | 1 | |
| 2588 | 1 | |
| 2586 | 1 | |
| 2583 | 1 | |
| 2576 | 1 | |
| 2575 | 1 | |
| 2574 | 1 | |
| 2570 | 1 | |
| 2567 | 1 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| Standard | |
|---|---|
| Good | |
| _ | |
| Bad |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.6909 |
| Min length | 1 |
Characters and Unicode
| Total characters | 234545 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Good |
|---|---|
| 2nd row | Good |
| 3rd row | Good |
| 4th row | Good |
| 5th row | Good |
Common Values
| Value | Count | Frequency (%) |
| Standard | 18379 | |
| Good | 12260 | |
| _ | 9805 | |
| Bad | 9556 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| standard | 18379 | |
| good | 12260 | |
| 9805 | ||
| bad | 9556 |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 58574 | |
| a | 46314 | |
| o | 24520 | |
| S | 18379 | 7.8% |
| t | 18379 | 7.8% |
| n | 18379 | 7.8% |
| r | 18379 | 7.8% |
| G | 12260 | 5.2% |
| _ | 9805 | 4.2% |
| B | 9556 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 184545 | |
| Uppercase Letter | 40195 | 17.1% |
| Connector Punctuation | 9805 | 4.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 58574 | |
| a | 46314 | |
| o | 24520 | |
| t | 18379 | 10.0% |
| n | 18379 | 10.0% |
| r | 18379 | 10.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 18379 | |
| G | 12260 | |
| B | 9556 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 9805 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 224740 | |
| Common | 9805 | 4.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| d | 58574 | |
| a | 46314 | |
| o | 24520 | |
| S | 18379 | 8.2% |
| t | 18379 | 8.2% |
| n | 18379 | 8.2% |
| r | 18379 | 8.2% |
| G | 12260 | 5.5% |
| B | 9556 | 4.3% |
Common
| Value | Count | Frequency (%) |
| _ | 9805 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 234545 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| d | 58574 | |
| a | 46314 | |
| o | 24520 | |
| S | 18379 | 7.8% |
| t | 18379 | 7.8% |
| n | 18379 | 7.8% |
| r | 18379 | 7.8% |
| G | 12260 | 5.2% |
| _ | 9805 | 4.2% |
| B | 9556 | 4.1% |
| Distinct | 12685 |
|---|---|
| Distinct (%) | 25.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| 1109.03 | 12 |
|---|---|
| 1151.7 | 12 |
| 1360.45 | 12 |
| 460.46 | 12 |
| 1428.31 | 8 |
| Other values (12680) |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.43302 |
| Min length | 3 |
Characters and Unicode
| Total characters | 321651 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 473 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | 809.98 |
|---|---|
| 2nd row | 809.98 |
| 3rd row | 809.98 |
| 4th row | 809.98 |
| 5th row | 605.03 |
Common Values
| Value | Count | Frequency (%) |
| 1109.03 | 12 | < 0.1% |
| 1151.7 | 12 | < 0.1% |
| 1360.45 | 12 | < 0.1% |
| 460.46 | 12 | < 0.1% |
| 1428.31 | 8 | < 0.1% |
| 950.59 | 8 | < 0.1% |
| 1334.81 | 8 | < 0.1% |
| 2329.28 | 8 | < 0.1% |
| 952.39 | 8 | < 0.1% |
| 1812.46 | 8 | < 0.1% |
| Other values (12675) | 49904 |
Length
| Value | Count | Frequency (%) |
| 1109.03 | 12 | < 0.1% |
| 1360.45 | 12 | < 0.1% |
| 460.46 | 12 | < 0.1% |
| 1151.7 | 12 | < 0.1% |
| 796.88 | 8 | < 0.1% |
| 1434.18 | 8 | < 0.1% |
| 1421.95 | 8 | < 0.1% |
| 852.74 | 8 | < 0.1% |
| 1381.1 | 8 | < 0.1% |
| 1423.88 | 8 | < 0.1% |
| Other values (12193) | 49904 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 50000 | |
| 1 | 41784 | |
| 2 | 31968 | |
| 3 | 29420 | |
| 4 | 29176 | |
| 5 | 24732 | |
| 6 | 24456 | |
| 8 | 24004 | |
| 7 | 23832 | |
| 9 | 23572 | |
| Other values (2) | 18707 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 271160 | |
| Other Punctuation | 50000 | 15.5% |
| Connector Punctuation | 491 | 0.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 41784 | |
| 2 | 31968 | |
| 3 | 29420 | |
| 4 | 29176 | |
| 5 | 24732 | |
| 6 | 24456 | |
| 8 | 24004 | |
| 7 | 23832 | |
| 9 | 23572 | |
| 0 | 18216 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 50000 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 491 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 321651 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 50000 | |
| 1 | 41784 | |
| 2 | 31968 | |
| 3 | 29420 | |
| 4 | 29176 | |
| 5 | 24732 | |
| 6 | 24456 | |
| 8 | 24004 | |
| 7 | 23832 | |
| 9 | 23572 | |
| Other values (2) | 18707 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 321651 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 50000 | |
| 1 | 41784 | |
| 2 | 31968 | |
| 3 | 29420 | |
| 4 | 29176 | |
| 5 | 24732 | |
| 6 | 24456 | |
| 8 | 24004 | |
| 7 | 23832 | |
| 9 | 23572 | |
| Other values (2) | 18707 | 5.8% |
| Distinct | 50000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32.27958112 |
| Minimum | 20.50965206 |
|---|---|
| Maximum | 48.54066309 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 20.50965206 |
|---|---|
| 5-th percentile | 24.27433854 |
| Q1 | 28.06104036 |
| median | 32.28038958 |
| Q3 | 36.46859096 |
| 95-th percentile | 40.24488238 |
| Maximum | 48.54066309 |
| Range | 28.03101103 |
| Interquartile range (IQR) | 8.407550602 |
Descriptive statistics
| Standard deviation | 5.106237733 |
|---|---|
| Coefficient of variation (CV) | 0.1581878561 |
| Kurtosis | -0.9494207268 |
| Mean | 32.27958112 |
| Median Absolute Deviation (MAD) | 4.201862883 |
| Skewness | 0.03759574349 |
| Sum | 1613979.056 |
| Variance | 26.07366378 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 37.29138904 | 1 | < 0.1% |
| 31.50968122 | 1 | < 0.1% |
| 27.90120124 | 1 | < 0.1% |
| 28.66930586 | 1 | < 0.1% |
| 35.05934584 | 1 | < 0.1% |
| 33.14716532 | 1 | < 0.1% |
| 39.76709718 | 1 | < 0.1% |
| 37.46144502 | 1 | < 0.1% |
| 41.83887758 | 1 | < 0.1% |
| 24.25796881 | 1 | < 0.1% |
| Other values (49990) | 49990 |
| Value | Count | Frequency (%) |
| 20.50965206 | 1 | |
| 20.62001732 | 1 | |
| 20.73922549 | 1 | |
| 20.80058685 | 1 | |
| 20.83922638 | 1 | |
| 20.91964798 | 1 | |
| 21.11966911 | 1 | |
| 21.14020193 | 1 | |
| 21.18158151 | 1 | |
| 21.18710526 | 1 |
| Value | Count | Frequency (%) |
| 48.54066309 | 1 | |
| 48.22871401 | 1 | |
| 48.15277749 | 1 | |
| 48.09645727 | 1 | |
| 48.06528066 | 1 | |
| 47.28898726 | 1 | |
| 47.23010359 | 1 | |
| 47.16317245 | 1 | |
| 46.97777638 | 1 | |
| 46.94753325 | 1 |
| Distinct | 399 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 4470 |
| Missing (%) | 8.9% |
| Memory size | 390.8 KiB |
| 16 Years and 1 Months | 254 |
|---|---|
| 20 Years and 1 Months | 254 |
| 18 Years and 7 Months | 252 |
| 19 Years and 7 Months | 252 |
| 18 Years and 6 Months | 250 |
| Other values (394) |
Length
| Max length | 22 |
|---|---|
| Median length | 21 |
| Mean length | 20.97537887 |
| Min length | 20 |
Characters and Unicode
| Total characters | 955009 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 22 Years and 9 Months |
|---|---|
| 2nd row | 22 Years and 10 Months |
| 3rd row | 23 Years and 0 Months |
| 4th row | 27 Years and 3 Months |
| 5th row | 27 Years and 4 Months |
Common Values
| Value | Count | Frequency (%) |
| 16 Years and 1 Months | 254 | 0.5% |
| 20 Years and 1 Months | 254 | 0.5% |
| 18 Years and 7 Months | 252 | 0.5% |
| 19 Years and 7 Months | 252 | 0.5% |
| 18 Years and 6 Months | 250 | 0.5% |
| 16 Years and 6 Months | 248 | 0.5% |
| 19 Years and 1 Months | 242 | 0.5% |
| 18 Years and 1 Months | 241 | 0.5% |
| 16 Years and 7 Months | 238 | 0.5% |
| 20 Years and 0 Months | 236 | 0.5% |
| Other values (389) | 43063 | |
| (Missing) | 4470 | 8.9% |
Length
| Value | Count | Frequency (%) |
| and | 45530 | |
| months | 45530 | |
| years | 45530 | |
| 6 | 6000 | 2.6% |
| 7 | 5955 | 2.6% |
| 1 | 4948 | 2.2% |
| 8 | 4885 | 2.1% |
| 9 | 4857 | 2.1% |
| 10 | 4766 | 2.1% |
| 11 | 4651 | 2.0% |
| Other values (28) | 54998 |
Most occurring characters
| Value | Count | Frequency (%) |
| 182120 | ||
| a | 91060 | |
| s | 91060 | |
| n | 91060 | |
| o | 45530 | 4.8% |
| t | 45530 | 4.8% |
| Y | 45530 | 4.8% |
| e | 45530 | 4.8% |
| r | 45530 | 4.8% |
| d | 45530 | 4.8% |
| Other values (12) | 226529 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 546360 | |
| Space Separator | 182120 | 19.1% |
| Decimal Number | 135469 | 14.2% |
| Uppercase Letter | 91060 | 9.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 36744 | |
| 2 | 22565 | |
| 3 | 13607 | 10.0% |
| 0 | 12936 | 9.5% |
| 6 | 9805 | 7.2% |
| 7 | 9479 | 7.0% |
| 8 | 8668 | 6.4% |
| 9 | 8615 | 6.4% |
| 4 | 6716 | 5.0% |
| 5 | 6334 | 4.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 91060 | |
| s | 91060 | |
| n | 91060 | |
| o | 45530 | |
| t | 45530 | |
| e | 45530 | |
| r | 45530 | |
| d | 45530 | |
| h | 45530 |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 45530 | |
| M | 45530 |
Space Separator
| Value | Count | Frequency (%) |
| 182120 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 637420 | |
| Common | 317589 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 182120 | ||
| 1 | 36744 | 11.6% |
| 2 | 22565 | 7.1% |
| 3 | 13607 | 4.3% |
| 0 | 12936 | 4.1% |
| 6 | 9805 | 3.1% |
| 7 | 9479 | 3.0% |
| 8 | 8668 | 2.7% |
| 9 | 8615 | 2.7% |
| 4 | 6716 | 2.1% |
Latin
| Value | Count | Frequency (%) |
| a | 91060 | |
| s | 91060 | |
| n | 91060 | |
| o | 45530 | |
| t | 45530 | |
| Y | 45530 | |
| e | 45530 | |
| r | 45530 | |
| d | 45530 | |
| M | 45530 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 955009 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 182120 | ||
| a | 91060 | |
| s | 91060 | |
| n | 91060 | |
| o | 45530 | 4.8% |
| t | 45530 | 4.8% |
| Y | 45530 | 4.8% |
| e | 45530 | 4.8% |
| r | 45530 | 4.8% |
| d | 45530 | 4.8% |
| Other values (12) | 226529 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| Yes | |
|---|---|
| No | |
| NM |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.52316 |
| Min length | 2 |
Characters and Unicode
| Total characters | 126158 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No |
|---|---|
| 2nd row | No |
| 3rd row | No |
| 4th row | No |
| 5th row | No |
Common Values
| Value | Count | Frequency (%) |
| Yes | 26158 | |
| No | 17849 | |
| NM | 5993 | 12.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| yes | 26158 | |
| no | 17849 | |
| nm | 5993 | 12.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 26158 | |
| e | 26158 | |
| s | 26158 | |
| N | 23842 | |
| o | 17849 | |
| M | 5993 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 70165 | |
| Uppercase Letter | 55993 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 26158 | |
| N | 23842 | |
| M | 5993 | 10.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 26158 | |
| s | 26158 | |
| o | 17849 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 126158 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 26158 | |
| e | 26158 | |
| s | 26158 | |
| N | 23842 | |
| o | 17849 | |
| M | 5993 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 126158 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 26158 | |
| e | 26158 | |
| s | 26158 | |
| N | 23842 | |
| o | 17849 | |
| M | 5993 | 4.8% |
| Distinct | 13144 |
|---|---|
| Distinct (%) | 26.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1491.304305 |
| Minimum | 0 |
|---|---|
| Maximum | 82398 |
| Zeros | 5002 |
| Zeros (%) | 10.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 32.22238767 |
| median | 74.73334891 |
| Q3 | 176.1574914 |
| 95-th percentile | 683.4115255 |
| Maximum | 82398 |
| Range | 82398 |
| Interquartile range (IQR) | 143.9351037 |
Descriptive statistics
| Standard deviation | 8595.647887 |
|---|---|
| Coefficient of variation (CV) | 5.763845687 |
| Kurtosis | 49.80255452 |
| Mean | 1491.304305 |
| Median Absolute Deviation (MAD) | 55.07584522 |
| Skewness | 6.946275256 |
| Sum | 74565215.26 |
| Variance | 73885162.59 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5002 | 10.0% |
| 54.70127741 | 4 | < 0.1% |
| 93.60161743 | 4 | < 0.1% |
| 96.2655719 | 4 | < 0.1% |
| 92.42547138 | 4 | < 0.1% |
| 244.4200738 | 4 | < 0.1% |
| 104.7461774 | 4 | < 0.1% |
| 69.3798369 | 4 | < 0.1% |
| 45.17437938 | 4 | < 0.1% |
| 85.88604716 | 4 | < 0.1% |
| Other values (13134) | 44962 |
| Value | Count | Frequency (%) |
| 0 | 5002 | |
| 4.462837467 | 4 | < 0.1% |
| 4.713183572 | 4 | < 0.1% |
| 4.865689677 | 4 | < 0.1% |
| 4.916138542 | 4 | < 0.1% |
| 5.138484696 | 4 | < 0.1% |
| 5.218466359 | 4 | < 0.1% |
| 5.24927327 | 4 | < 0.1% |
| 5.262291048 | 4 | < 0.1% |
| 5.351086151 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 82398 | 1 | |
| 82347 | 1 | |
| 82316 | 1 | |
| 82248 | 1 | |
| 82235 | 1 | |
| 82225 | 1 | |
| 82091 | 1 | |
| 82071 | 1 | |
| 82023 | 1 | |
| 82016 | 1 |
| Distinct | 45450 |
|---|---|
| Distinct (%) | 95.2% |
| Missing | 2271 |
| Missing (%) | 4.5% |
| Memory size | 390.8 KiB |
| __10000__ | 2175 |
|---|---|
| 0.0 | 106 |
| 146.45223558174197 | 1 |
| 124.69037914425093 | 1 |
| 263.87560234624675 | 1 |
| Other values (45445) |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 16.95711203 |
| Min length | 3 |
Characters and Unicode
| Total characters | 809346 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 45448 ? |
|---|---|
| Unique (%) | 95.2% |
Sample
| 1st row | 236.64268203272135 |
|---|---|
| 2nd row | 21.465380264657146 |
| 3rd row | 148.23393788500925 |
| 4th row | 39.08251089460281 |
| 5th row | 39.684018417945296 |
Common Values
| Value | Count | Frequency (%) |
| __10000__ | 2175 | 4.3% |
| 0.0 | 106 | 0.2% |
| 146.45223558174197 | 1 | < 0.1% |
| 124.69037914425093 | 1 | < 0.1% |
| 263.87560234624675 | 1 | < 0.1% |
| 36.367528830515006 | 1 | < 0.1% |
| 82.41238403428513 | 1 | < 0.1% |
| 437.963686580976 | 1 | < 0.1% |
| 34.19467092596217 | 1 | < 0.1% |
| 101.72654992407084 | 1 | < 0.1% |
| Other values (45440) | 45440 | |
| (Missing) | 2271 | 4.5% |
Length
| Value | Count | Frequency (%) |
| 10000 | 2175 | 4.6% |
| 0.0 | 106 | 0.2% |
| 80.47799072248316 | 1 | < 0.1% |
| 47.77825691895126 | 1 | < 0.1% |
| 428.2140784268283 | 1 | < 0.1% |
| 220.2312521677813 | 1 | < 0.1% |
| 416.6748620847473 | 1 | < 0.1% |
| 72.90584412338201 | 1 | < 0.1% |
| 438.31259341087883 | 1 | < 0.1% |
| 58.49694106863918 | 1 | < 0.1% |
| Other values (45440) | 45440 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 86365 | |
| 2 | 77857 | |
| 4 | 76027 | |
| 3 | 75728 | |
| 0 | 75606 | |
| 6 | 74699 | |
| 5 | 74363 | |
| 8 | 72540 | |
| 7 | 72532 | |
| 9 | 69375 | |
| Other values (2) | 54254 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 755092 | |
| Other Punctuation | 45554 | 5.6% |
| Connector Punctuation | 8700 | 1.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 86365 | |
| 2 | 77857 | |
| 4 | 76027 | |
| 3 | 75728 | |
| 0 | 75606 | |
| 6 | 74699 | |
| 5 | 74363 | |
| 8 | 72540 | |
| 7 | 72532 | |
| 9 | 69375 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 45554 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8700 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 809346 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 86365 | |
| 2 | 77857 | |
| 4 | 76027 | |
| 3 | 75728 | |
| 0 | 75606 | |
| 6 | 74699 | |
| 5 | 74363 | |
| 8 | 72540 | |
| 7 | 72532 | |
| 9 | 69375 | |
| Other values (2) | 54254 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 809346 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 86365 | |
| 2 | 77857 | |
| 4 | 76027 | |
| 3 | 75728 | |
| 0 | 75606 | |
| 6 | 74699 | |
| 5 | 74363 | |
| 8 | 72540 | |
| 7 | 72532 | |
| 9 | 69375 | |
| Other values (2) | 54254 |
Payment_Behaviour
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| Low_spent_Small_value_payments | |
|---|---|
| High_spent_Medium_value_payments | |
| High_spent_Large_value_payments | |
| Low_spent_Medium_value_payments | |
| High_spent_Small_value_payments | |
| Other values (2) |
Length
| Max length | 32 |
|---|---|
| Median length | 31 |
| Mean length | 28.91952 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1445976 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Low_spent_Small_value_payments |
|---|---|
| 2nd row | High_spent_Medium_value_payments |
| 3rd row | Low_spent_Medium_value_payments |
| 4th row | High_spent_Medium_value_payments |
| 5th row | High_spent_Large_value_payments |
Common Values
| Value | Count | Frequency (%) |
| Low_spent_Small_value_payments | 12694 | |
| High_spent_Medium_value_payments | 8922 | |
| High_spent_Large_value_payments | 6844 | |
| Low_spent_Medium_value_payments | 6837 | |
| High_spent_Small_value_payments | 5651 | |
| Low_spent_Large_value_payments | 5252 | |
| !@9#%8 | 3800 | 7.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| low_spent_small_value_payments | 12694 | |
| high_spent_medium_value_payments | 8922 | |
| high_spent_large_value_payments | 6844 | |
| low_spent_medium_value_payments | 6837 | |
| high_spent_small_value_payments | 5651 | |
| low_spent_large_value_payments | 5252 | |
| 9#%8 | 3800 | 7.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 184800 | |
| e | 166455 | |
| a | 122841 | 8.5% |
| s | 92400 | 6.4% |
| p | 92400 | 6.4% |
| n | 92400 | 6.4% |
| t | 92400 | 6.4% |
| l | 82890 | 5.7% |
| m | 80304 | 5.6% |
| u | 61959 | 4.3% |
| Other values (19) | 377127 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1145976 | |
| Connector Punctuation | 184800 | 12.8% |
| Uppercase Letter | 92400 | 6.4% |
| Other Punctuation | 15200 | 1.1% |
| Decimal Number | 7600 | 0.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 166455 | |
| a | 122841 | |
| s | 92400 | |
| p | 92400 | |
| n | 92400 | |
| t | 92400 | |
| l | 82890 | 7.2% |
| m | 80304 | 7.0% |
| u | 61959 | 5.4% |
| v | 46200 | 4.0% |
| Other values (8) | 215727 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 36879 | |
| H | 21417 | |
| S | 18345 | |
| M | 15759 |
Other Punctuation
| Value | Count | Frequency (%) |
| ! | 3800 | |
| @ | 3800 | |
| # | 3800 | |
| % | 3800 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 3800 | |
| 8 | 3800 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 184800 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1238376 | |
| Common | 207600 | 14.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 166455 | |
| a | 122841 | 9.9% |
| s | 92400 | 7.5% |
| p | 92400 | 7.5% |
| n | 92400 | 7.5% |
| t | 92400 | 7.5% |
| l | 82890 | 6.7% |
| m | 80304 | 6.5% |
| u | 61959 | 5.0% |
| v | 46200 | 3.7% |
| Other values (12) | 308127 |
Common
| Value | Count | Frequency (%) |
| _ | 184800 | |
| ! | 3800 | 1.8% |
| @ | 3800 | 1.8% |
| 9 | 3800 | 1.8% |
| # | 3800 | 1.8% |
| % | 3800 | 1.8% |
| 8 | 3800 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1445976 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 184800 | |
| e | 166455 | |
| a | 122841 | 8.5% |
| s | 92400 | 6.4% |
| p | 92400 | 6.4% |
| n | 92400 | 6.4% |
| t | 92400 | 6.4% |
| l | 82890 | 5.7% |
| m | 80304 | 5.6% |
| u | 61959 | 4.3% |
| Other values (19) | 377127 |
| Distinct | 49433 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 562 |
| Missing (%) | 1.1% |
| Memory size | 390.8 KiB |
| __-333333333333333333333333333__ | 6 |
|---|---|
| 329.8161038704352 | 1 |
| 403.8475175077663 | 1 |
| 382.1571836917757 | 1 |
| 203.42087912420956 | 1 |
| Other values (49428) |
Length
| Max length | 32 |
|---|---|
| Median length | 17 |
| Mean length | 17.34234799 |
| Min length | 13 |
Characters and Unicode
| Total characters | 857371 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 49432 ? |
|---|---|
| Unique (%) | > 99.9% |
Sample
| 1st row | 186.26670208571772 |
|---|---|
| 2nd row | 361.44400385378196 |
| 3rd row | 264.67544623342997 |
| 4th row | 343.82687322383634 |
| 5th row | 485.2984336755923 |
Common Values
| Value | Count | Frequency (%) |
| __-333333333333333333333333333__ | 6 | < 0.1% |
| 329.8161038704352 | 1 | < 0.1% |
| 403.8475175077663 | 1 | < 0.1% |
| 382.1571836917757 | 1 | < 0.1% |
| 203.42087912420956 | 1 | < 0.1% |
| 300.7943892000848 | 1 | < 0.1% |
| 391.68606982234553 | 1 | < 0.1% |
| 728.4641774729828 | 1 | < 0.1% |
| 265.7679202700299 | 1 | < 0.1% |
| 500.19846383311386 | 1 | < 0.1% |
| Other values (49423) | 49423 | |
| (Missing) | 562 | 1.1% |
Length
| Value | Count | Frequency (%) |
| 333333333333333333333333333 | 6 | < 0.1% |
| 385.3004381161553 | 1 | < 0.1% |
| 300.5226866460017 | 1 | < 0.1% |
| 459.94881480995565 | 1 | < 0.1% |
| 314.86461341416197 | 1 | < 0.1% |
| 293.7475635901477 | 1 | < 0.1% |
| 520.0915175778559 | 1 | < 0.1% |
| 260.4064883149828 | 1 | < 0.1% |
| 388.7936705325999 | 1 | < 0.1% |
| 595.220959147716 | 1 | < 0.1% |
| Other values (49423) | 49423 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 90947 | |
| 2 | 90118 | |
| 4 | 84677 | |
| 5 | 81313 | |
| 6 | 80808 | |
| 7 | 78501 | |
| 1 | 77986 | |
| 8 | 77928 | |
| 9 | 74663 | |
| 0 | 70968 | |
| Other values (3) | 49462 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 807909 | |
| Other Punctuation | 49432 | 5.8% |
| Connector Punctuation | 24 | < 0.1% |
| Dash Punctuation | 6 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 90947 | |
| 2 | 90118 | |
| 4 | 84677 | |
| 5 | 81313 | |
| 6 | 80808 | |
| 7 | 78501 | |
| 1 | 77986 | |
| 8 | 77928 | |
| 9 | 74663 | |
| 0 | 70968 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 49432 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 24 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 857371 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 90947 | |
| 2 | 90118 | |
| 4 | 84677 | |
| 5 | 81313 | |
| 6 | 80808 | |
| 7 | 78501 | |
| 1 | 77986 | |
| 8 | 77928 | |
| 9 | 74663 | |
| 0 | 70968 | |
| Other values (3) | 49462 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 857371 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 90947 | |
| 2 | 90118 | |
| 4 | 84677 | |
| 5 | 81313 | |
| 6 | 80808 | |
| 7 | 78501 | |
| 1 | 77986 | |
| 8 | 77928 | |
| 9 | 74663 | |
| 0 | 70968 | |
| Other values (3) | 49462 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| ID | Customer_ID | Month | Name | Age | SSN | Occupation | Annual_Income | Monthly_Inhand_Salary | Num_Bank_Accounts | Num_Credit_Card | Interest_Rate | Num_of_Loan | Type_of_Loan | Delay_from_due_date | Num_of_Delayed_Payment | Changed_Credit_Limit | Num_Credit_Inquiries | Credit_Mix | Outstanding_Debt | Credit_Utilization_Ratio | Credit_History_Age | Payment_of_Min_Amount | Total_EMI_per_month | Amount_invested_monthly | Payment_Behaviour | Monthly_Balance | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0x160a | CUS_0xd40 | September | Aaron Maashoh | 23 | 821-00-0265 | Scientist | 19114.12 | 1824.843333 | 3 | 4 | 3 | 4 | Auto Loan, Credit-Builder Loan, Personal Loan, and Home Equity Loan | 3 | 7 | 11.27 | 2022.0 | Good | 809.98 | 35.030402 | 22 Years and 9 Months | No | 49.574949 | 236.64268203272135 | Low_spent_Small_value_payments | 186.26670208571772 |
| 1 | 0x160b | CUS_0xd40 | October | Aaron Maashoh | 24 | 821-00-0265 | Scientist | 19114.12 | 1824.843333 | 3 | 4 | 3 | 4 | Auto Loan, Credit-Builder Loan, Personal Loan, and Home Equity Loan | 3 | 9 | 13.27 | 4.0 | Good | 809.98 | 33.053114 | 22 Years and 10 Months | No | 49.574949 | 21.465380264657146 | High_spent_Medium_value_payments | 361.44400385378196 |
| 2 | 0x160c | CUS_0xd40 | November | Aaron Maashoh | 24 | 821-00-0265 | Scientist | 19114.12 | 1824.843333 | 3 | 4 | 3 | 4 | Auto Loan, Credit-Builder Loan, Personal Loan, and Home Equity Loan | -1 | 4 | 12.27 | 4.0 | Good | 809.98 | 33.811894 | NaN | No | 49.574949 | 148.23393788500925 | Low_spent_Medium_value_payments | 264.67544623342997 |
| 3 | 0x160d | CUS_0xd40 | December | Aaron Maashoh | 24_ | 821-00-0265 | Scientist | 19114.12 | NaN | 3 | 4 | 3 | 4 | Auto Loan, Credit-Builder Loan, Personal Loan, and Home Equity Loan | 4 | 5 | 11.27 | 4.0 | Good | 809.98 | 32.430559 | 23 Years and 0 Months | No | 49.574949 | 39.08251089460281 | High_spent_Medium_value_payments | 343.82687322383634 |
| 4 | 0x1616 | CUS_0x21b1 | September | Rick Rothackerj | 28 | 004-07-5839 | _______ | 34847.84 | 3037.986667 | 2 | 4 | 6 | 1 | Credit-Builder Loan | 3 | 1 | 5.42 | 5.0 | Good | 605.03 | 25.926822 | 27 Years and 3 Months | No | 18.816215 | 39.684018417945296 | High_spent_Large_value_payments | 485.2984336755923 |
| 5 | 0x1617 | CUS_0x21b1 | October | Rick Rothackerj | 28 | #F%$D@*&8 | Teacher | 34847.84 | 3037.986667 | 2 | 4 | 6 | 1 | Credit-Builder Loan | 3 | 3 | 5.42 | 5.0 | Good | 605.03 | 30.116600 | 27 Years and 4 Months | No | 18.816215 | 251.62736875017606 | Low_spent_Large_value_payments | 303.3550833433617 |
| 6 | 0x1618 | CUS_0x21b1 | November | Rick Rothackerj | 28 | 004-07-5839 | Teacher | 34847.84 | 3037.986667 | 2 | 4 | 6 | 1 | Credit-Builder Loan | 3 | NaN | 5.42 | 5.0 | _ | 605.03 | 30.996424 | 27 Years and 5 Months | No | 18.816215 | 72.68014533363515 | High_spent_Large_value_payments | 452.30230675990265 |
| 7 | 0x1619 | CUS_0x21b1 | December | Rick Rothackerj | 28 | 004-07-5839 | Teacher | 34847.84 | 3037.986667 | 2 | 4 | 6 | 1 | Credit-Builder Loan | 3 | 2_ | 7.42 | 5.0 | _ | 605.03 | 33.875167 | 27 Years and 6 Months | No | 18.816215 | 153.53448761392985 | !@9#%8 | 421.44796447960783 |
| 8 | 0x1622 | CUS_0x2dbc | September | Langep | 35 | 486-85-3974 | Engineer | 143162.64 | NaN | 1 | 5 | 8 | 3 | Auto Loan, Auto Loan, and Not Specified | 8 | 1942 | 7.1 | 3.0 | Good | 1303.01 | 35.229707 | 18 Years and 5 Months | No | 246.992319 | 397.50365354404653 | Low_spent_Medium_value_payments | 854.2260270022115 |
| 9 | 0x1623 | CUS_0x2dbc | October | Langep | 35 | 486-85-3974 | Engineer | 143162.64 | 12187.220000 | 1 | 5 | 8 | 3 | Auto Loan, Auto Loan, and Not Specified | 6 | 3 | 2.1 | 3.0 | Good | 1303.01 | 35.685836 | 18 Years and 6 Months | No | 246.992319 | 453.6151305781054 | Low_spent_Large_value_payments | 788.1145499681528 |
Last rows
| ID | Customer_ID | Month | Name | Age | SSN | Occupation | Annual_Income | Monthly_Inhand_Salary | Num_Bank_Accounts | Num_Credit_Card | Interest_Rate | Num_of_Loan | Type_of_Loan | Delay_from_due_date | Num_of_Delayed_Payment | Changed_Credit_Limit | Num_Credit_Inquiries | Credit_Mix | Outstanding_Debt | Credit_Utilization_Ratio | Credit_History_Age | Payment_of_Min_Amount | Total_EMI_per_month | Amount_invested_monthly | Payment_Behaviour | Monthly_Balance | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 49990 | 0x25fd8 | CUS_0xaf61 | November | Chris Wickhamm | 50 | 133-16-7738 | Writer | 37188.1 | 3097.008333 | 1 | 4 | 4252 | 3 | Home Equity Loan, Mortgage Loan, and Student Loan | 7 | 12 | 5.38 | 3.0 | Good | 620.64 | 25.708414 | 30 Years and 7 Months | No | 84.205949 | 183.3656280777276 | Low_spent_Large_value_payments | 312.1292558307615 |
| 49991 | 0x25fd9 | CUS_0xaf61 | December | Chris Wickhamm | 50_ | 133-16-7738 | Writer | 37188.1 | 3097.008333 | 1 | 4 | 5 | 3 | Home Equity Loan, Mortgage Loan, and Student Loan | 3 | 12 | 5.38 | 3.0 | _ | 620.64 | 36.498383 | 30 Years and 8 Months | No | 33013.000000 | 238.3993828976901 | Low_spent_Large_value_payments | 257.095501010799 |
| 49992 | 0x25fe2 | CUS_0x8600 | September | Sarah McBridec | 29 | 031-35-0942 | Architect | 20002.88 | 1929.906667 | 10 | 8 | 29 | 5 | Personal Loan, Auto Loan, Mortgage Loan, Student Loan, and Student Loan | 33 | 25 | 18.31 | 9.0 | Bad | 3571.7 | 32.391288 | 6 Years and 4 Months | Yes | 60.964772 | 107.21074164760236 | Low_spent_Small_value_payments | 314.8151526456419 |
| 49993 | 0x25fe3 | CUS_0x8600 | October | Sarah McBridec | 29 | 031-35-0942 | Architect | 20002.88 | 1929.906667 | 10 | 8 | 29 | 5 | Personal Loan, Auto Loan, Mortgage Loan, Student Loan, and Student Loan | 33 | 25 | 18.31 | 12.0 | Bad | 3571.7 | 37.528511 | 6 Years and 5 Months | Yes | 60.964772 | 71.79442082882734 | Low_spent_Small_value_payments | 350.23147346441687 |
| 49994 | 0x25fe4 | CUS_0x8600 | November | Sarah McBridec | 29 | 031-35-0942 | _______ | 20002.88 | 1929.906667 | 10 | 8 | 29 | 5 | Personal Loan, Auto Loan, Mortgage Loan, Student Loan, and Student Loan | 33 | 22 | 18.31 | 12.0 | Bad | 3571.7 | 27.027812 | 6 Years and 6 Months | Yes | 60.964772 | 50.84684680498023 | High_spent_Small_value_payments | 341.179047488264 |
| 49995 | 0x25fe5 | CUS_0x8600 | December | Sarah McBridec | 4975 | 031-35-0942 | Architect | 20002.88 | 1929.906667 | 10 | 8 | 29 | 5 | Personal Loan, Auto Loan, Mortgage Loan, Student Loan, and Student Loan | 33 | 25 | 18.31 | 12.0 | _ | 3571.7 | 34.780553 | NaN | Yes | 60.964772 | 146.48632477751087 | Low_spent_Small_value_payments | 275.53956951573343 |
| 49996 | 0x25fee | CUS_0x942c | September | Nicks | 25 | 078-73-5990 | Mechanic | 39628.99 | NaN | 4 | 6 | 7 | 2_ | Auto Loan, and Student Loan | 20 | NaN | 11.5 | 7.0 | Good | 502.38 | 27.758522 | 31 Years and 11 Months | NM | 35.104023 | 181.44299902757518 | Low_spent_Small_value_payments | 409.39456169535066 |
| 49997 | 0x25fef | CUS_0x942c | October | Nicks | 25 | 078-73-5990 | Mechanic | 39628.99 | 3359.415833 | 4 | 6 | 7 | 2 | Auto Loan, and Student Loan | 23 | 5 | 13.5 | 7.0 | Good | 502.38 | 36.858542 | 32 Years and 0 Months | No | 35.104023 | __10000__ | Low_spent_Large_value_payments | 349.7263321025098 |
| 49998 | 0x25ff0 | CUS_0x942c | November | Nicks | 25 | 078-73-5990 | Mechanic | 39628.99 | NaN | 4 | 6 | 7 | 2_ | Auto Loan, and Student Loan | 21 | 6_ | 11.5 | 7.0 | Good | 502.38 | 39.139840 | 32 Years and 1 Months | No | 35.104023 | 97.59857973344877 | High_spent_Small_value_payments | 463.23898098947717 |
| 49999 | 0x25ff1 | CUS_0x942c | December | Nicks | 25 | 078-73-5990 | Mechanic | 39628.99 | 3359.415833 | 4 | 6 | 7 | 2 | Auto Loan, and Student Loan | 22 | 5 | 11.5 | 7.0 | _ | 502.38 | 34.108530 | 32 Years and 2 Months | No | 35.104023 | 220.45787812168732 | Low_spent_Medium_value_payments | 360.37968260123847 |